Integrating Logical Operators in Query Expansion in Vector Space Model
نویسندگان
چکیده
Query expansion is an effective way to extend the coverage of retrieval to the related documents. Various approaches have been proposed and many of them are based o vector space model. The expansion process consists of simply adding expansion terms into the original vector. In this paper we argue that this simplistic expansion method can bias the focus of the original query, because the expanded terms add additional emphasis to the original term. Instead of adding expansion terms into the vector, we propose to combine them with the original terms by means of a logical OR operator. In this way, the expansion terms are considered as alternatives to the original terms, and the focus of the whole query remain the unchanged. Our experiments on a REC collection show that this expansion approach is more effective than simple addition approach.
منابع مشابه
بهبودی در سیستم های پیشنهادگر خبره با استفاده از بسط پرسش و مدل فضای برداری
Due to enormous volume of information available on the Web, finding appropriate knowledge in a short time seems difficult. Knowledge Recommender systems, Online Forums and Question Answering (QA) systems were created to facilitate finding suitable information. QA systems use knowledge repositories to retrieve brief responses to users’ queries. Expert Finding system, not only causes knowledge tr...
متن کاملSome properties of continuous linear operators in topological vector PN-spaces
The notion of a probabilistic metric space corresponds to thesituations when we do not know exactly the distance. Probabilistic Metric space was introduced by Karl Menger. Alsina, Schweizer and Sklar gave a general definition of probabilistic normed space based on the definition of Menger [1]. In this note we study the PN spaces which are topological vector spaces and the open mapping an...
متن کاملمدل جدیدی برای جستجوی عبارت بر اساس کمینه جابهجایی وزندار
Finding high-quality web pages is one of the most important tasks of search engines. The relevance between the documents found and the query searched depends on the user observation and increases the complexity of ranking algorithms. The other issue is that users often explore just the first 10 to 20 results while millions of pages related to a query may exist. So search engines have to use sui...
متن کاملPOINT DERIVATIONS ON BANACH ALGEBRAS OF α-LIPSCHITZ VECTOR-VALUED OPERATORS
The Lipschitz function algebras were first defined in the 1960s by some mathematicians, including Schubert. Initially, the Lipschitz real-value and complex-value functions are defined and quantitative properties of these algebras are investigated. Over time these algebras have been studied and generalized by many mathematicians such as Cao, Zhang, Xu, Weaver, and others. Let be a non-emp...
متن کاملRocchio's Model Based on Vector Space Basis Change for Pseudo Relevance Feedback
Rocchio’s relevance feedback model is a classic query expansion method and it has been shown to be effective in boosting information retrieval performance. The main problem with this method is that the relevant and the irrelevant documents overlap in the vector space because they often share same terms (at least the terms of the query). With respect to the initial vector space basis (
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2002